# Deep Reinforcement Learning

Poca SoccerTwos
A deep reinforcement learning agent trained with Unity ML-Agents, specifically designed for two-player soccer game scenarios.
Object Detection
P
honestlyanubhav
118
1
Ppo LunarLander V2
This is a reinforcement learning model based on the PPO algorithm, specifically trained for the LunarLander-v2 environment to safely control lunar landings.
Physics Model
P
sofiascat
14
1
Ppo Huggy
This is a PPO agent model trained using the Unity ML-Agents library, specifically designed for the Huggy game.
Multimodal Fusion TensorBoard
P
alex17127
75
1
Ppo Huggy
This is a PPO agent model trained using the Unity ML-Agents library, specifically designed to run the Huggy Game.
Multimodal Fusion TensorBoard
P
ErenDoymus
30
1
Ppo Huggy
This is a PPO agent model trained using the Unity ML-Agents library, specifically designed to control the behavior of the virtual dog Huggy.
Object Detection TensorBoard
P
hellonihao
52
1
Ppo LunarLander V2
This is a reinforcement learning model based on the PPO algorithm, specifically designed to solve the landing task in the LunarLander-v2 environment.
Physics Model
P
tooalvin
13
1
Ppo Huggy
This is a PPO agent model trained using the Unity ML-Agents library, specifically designed for reinforcement learning tasks in the Huggy game.
Multimodal Fusion TensorBoard
P
PampX
16
2
Poca SoccerTwos
A deep reinforcement learning model trained with Unity ML-Agents, specifically designed for two-player soccer game scenarios
Molecular Model TensorBoard
P
hishamcse
20
1
Mlunitypyramids
This is a PPO agent model trained using the Unity ML-Agents library, specifically designed for gaming in pyramid environments.
Multimodal Fusion TensorBoard
M
motmono
21
0
Test Worm
This is a reinforcement learning agent based on the PPO algorithm, specifically trained for Unity's worm game.
Image Generation TensorBoard
T
damilare-akin
15
0
Testworm
A reinforcement learning agent based on the PPO algorithm, specifically trained to play the Snake game
Image Generation TensorBoard
T
curt-tigges
85
0
Mlagents Worm
This is a PPO agent model trained using the Unity ML-Agents library, specifically designed for the Worm game.
Multimodal Fusion TensorBoard
M
danieladejumo
17
0
Testpushblock
A deep reinforcement learning agent trained using the PPO algorithm for Unity's PushBlock game environment
Molecular Model TensorBoard
T
rebolforces
30
0
Pushblock
This is a reinforcement learning agent based on the PPO algorithm, specifically trained to complete tasks in Unity's PushBlock environment.
Multimodal Fusion TensorBoard
P
mrm8488
35
0
Worm Unity ML
This is a PPO agent model trained using the Unity ML-Agents library, specifically designed for the Worm game environment.
Molecular Model TensorBoard
W
comodoro
14
0
Pyramidsrnd
This is a PPO agent model trained using the Unity ML-Agents library, specifically designed for gaming and decision-making in the pyramid environment.
Object Detection TensorBoard
P
mrm8488
25
1
Unitypyramidsrnd
This is a reinforcement learning agent based on the PPO algorithm, specifically trained for Unity's ML-Agents pyramid environment.
Object Detection TensorBoard
U
jakka
15
0
Testpyramidsrnd2
This is a PPO agent model trained using the Unity ML-Agents library, specifically designed to run the Pyramid game.
Object Detection TensorBoard
T
micheljperez
16
0
Mlagents Worm
This is a PPO agent model trained based on Unity ML-Agents, specifically designed for the Worm game environment.
Object Detection TensorBoard
M
infinitejoy
19
0
Ppo LunarLander V2
This is a reinforcement learning model based on the PPO algorithm, designed to solve control tasks in the LunarLander-v2 environment.
Physics Model
P
sigalaz
20
0
Unity Pyramids
This is a PPO agent model trained with Unity ML-Agents, specifically designed for the pyramid game environment
Object Detection TensorBoard
U
ra-XOr
23
1
Ppo LunarLander V2
This is a reinforcement learning model based on the PPO algorithm, specifically trained for the LunarLander-v2 environment to control the safe landing of a lunar lander.
Physics Model
P
andri
16
0
Sac Hopper V3
This is a reinforcement learning model based on the SAC algorithm, designed to control robot hopping movements in the Hopper-v3 environment.
Physics Model
S
sb3
44
0
Sac Walker2d V3
This is a reinforcement learning model based on the SAC algorithm, specifically designed for the Walker2d-v3 environment to control bipedal robot walking.
Physics Model
S
sb3
43
0
Assignment2 Omar
This is a reinforcement learning model based on the PPO algorithm, specifically designed to solve the landing task in the LunarLander-v2 environment.
Physics Model
A
Classroom-workshop
135
3
Td3 MountainCarContinuous V0
A TD3 reinforcement learning agent trained based on the stable-baselines3 library, specifically designed for the MountainCarContinuous-v0 environment.
Physics Model
T
sb3
203
0
Td3 HalfCheetah V3
This is a TD3 reinforcement learning agent trained using the stable-baselines3 library, specifically designed for the HalfCheetah-v3 environment, achieving an average reward of 9709.01.
Physics Model
T
sb3
23
0
Sac Pendulum V1
This is a reinforcement learning model based on the SAC algorithm, designed to solve control problems in the Pendulum-v1 environment.
Physics Model
S
sb3
39
0
Ppo Huggy
This is a PPO agent model trained using the Unity ML-Agents library, specifically designed for the Huggy game.
Image Generation TensorBoard
P
ThomasSimonini
28
2
Ball Test
A reinforcement learning agent based on the PPO algorithm, designed to control the balancing ball task in the Unity 3DBall environment
3D Vision TensorBoard
B
osanseviero
29
0
Ball
This is a reinforcement learning agent trained with the PPO algorithm, designed to control the balancing ball task in the Unity 3DBall game.
3D Vision TensorBoard
B
ThomasSimonini
23
0
Ppo SeaquestNoFrameskip V4
This is a PPO agent model trained using the stable-baselines3 library, specifically designed to play the Atari game SeaquestNoFrameskip-v4.
Video Processing
P
ThomasSimonini
205
0
Ppo BreakoutNoFrameskip V4
A deep reinforcement learning model trained using the PPO algorithm in the Atari Breakout environment
Video Processing
P
ThomasSimonini
459
3
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase